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ABSTRACT 

The  bounds  of  Bimbaum  (1942)  and  Sampford  (1953)  for  the  upper  tail 
area  of  the  normal  distribution  are  extended  to  the  upper  tail  of  the 
t-dis tribution  for  arbitrary  real  degrees  of  freedom.  This  generalizes 
the  results  of  Soms  (1977a)  for  integral  degrees  of  freedom.  Numerical 
and  theoretical  comparisons  are  made  with  the  bounds  of  Peizer  and  Pratt 
(1968),  Wallace  (1959)  and  Soms  (1977b). 
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SIGNIFICANCE  AND  EXPLANATION 


Real  degrees  of  freedom  arise  in  the  modification  of  the  two-sample 
t-test  when  the  variances  cannot  be  assumed  to  be  equal.  For  small  degrees 
of  freedom  both  linear  interpolation  and  standard  computer  routines  may  be 
unsatisfactory.  The  present  paper  provides  simple  techniques  for  satis¬ 
factory  estimates  of  small  tail  probabilities. 
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BOUNDS  FOR  THE  t-TAIL  AREA  FOR  ARBITRARY  DEGREES  OF  FREEDOM 


Andrew  P.  Soms 

1.  Introduction 

The  purpose  of  this  paper  Is  to  extend  to  the  Call  area  of  the 
t-dlstrlbutlon  bounds  given  by  Blrnbaum  (1942)  and  Sampford  (1953) 
for  the  tall  area  of  the  normal  distribution  based  on  Mills' 
ratio.  This  extension  of  Mills'  ratio  type  bounds  is  completed 
in  Soms  (1977b)  and  (1979)  and  the  basis  for  these  extensions  is 
the  asymptotic  expansion  obtained  in  Soms  (1976).  An  excellent 
summary  of  the  normal  results  can  be  found  in  Chapter  33  of 
Johnson  and  Kotz  (1970).  The  organization  of  this  paper  is  as 
follows.  In  Section  2,  we  Introduce  the  notation  and  give  needed 
previous  results.  The  proofs  are  given  in  Section  3,  in  Section 
4  theoretical  comparisons  between  the  bounds  are  made,  and 
Section  5  consists  of  numerical  comparisons. 


2.  Statement  of  Results 

Let  <|>(x)  ■  (2 it ) — **  e  x  ^2,  Fk(x)  ■  [  <|>(t)dt,  and 
_  *  x 

Rx  ■  (x) /<f  (x)  ,  x  >  0.  Then  Blrnbaum  (1942)  and  Sampford  (1953) 

gave  the  following  lower  and  upper  bounds,  respectively,  for  Rx: 
(x/2  +  (1+x2/4)1*)  1  <  Rx  <  (3x  +  (l/2+x2/16)ii)  1  .  (2.1) 


Nov  let  f„(t)  -  cv(l+t2/10-<k+1>/2,  c„  -  -UU&m, 


Fk(x)  -  1  -  Ffc(x)  - 


r(k/2)Ork) 

(t)dt ,  and  Rx  -  ?k(x) / ( (1+x2 /k) f k (x) ) ,  for 


real  k>0  and  real  x>0  here  and  throughout  the  paper.  Then  we 
will  show  here  that 
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(2.2) 

where  the  lower  bound  is  valid  for  k  >  1  and  the  upper  for  k  >  2. 
Note  that  if  for  fixed  x,  k  -►  »  in  (2.2)  we  obtain  (2.1)  with  M<" 
replacing  "<".  A  result  from  Soms  (1976)  which  will  be  used 
subsequently  is  that 

l/x  -  k/((k+2)x3)  < Rx <  l/x  ,  (2.3) 

for  real  k  >  0  (actually  the  proof  assumed  k  a  positive  integer 
but  this  fact  was  nowhere  used) . 

3.  Proofs 

Let  v(x)  ■  1/R^.  Then 

X  (x)  -  v1  (x)  ■=  V  (x)  (v  (x)  -  x)/(l+x2/k)  (3.1) 

and  the  first  result  is  that  0  <  A (x)  <  1  for  k  >  1  (here  and 
throughout,  "  f  "  means  the  derivative). 

Theorem  3.1:  Let  A(x)  be  given  by  (3.1).  Then  0  <  A(x)  <  1 
for  k  >  1. 

Proof :  For  convenience,  A(x)  and  v(x)  will  be  denoted  by  A 

and  v.  From  (2.3)  v > x,  and  hence  A  >  0.  Suppose  A  >  1  for  some 
x.  Then  there  exists  an  x  such  that  A  >  1  and  Af  «  0,  since  from 
(2.3)  lim  A  «  i  and  it  will  be  shown  in  Lemma  3.1  that 
A (0)  -  4cR2  <  1.  Now 


X’ 


v 


(l+x‘/k) 

V 

l+x2/k 

V 

t  ■ 

l+x2/k 


v  ,  k+1  w  k-1  „v  , 

o  (v-ir  x)(v-—  x)  + 


l+x  /k 


l+x  /k 


) _  ,  k-1  N  k-1 

rr  (v-—  — 


(v-  (k-1) x/k)  (2v-  (k+l)x/k)  k-1 

*  "  k 


l+x  /k 


,  ,,  k+1  x.  k-1 

U  "  k  v'  k 


l+x  /k 


(nr)a_1)  -  0  * 


and  hence  X'  >  0,  a  contradiction,  giving  the  conclusion. 
From  Theorem  3.1, 

v(v  -  (k-l)x/k)  <  1  +x2/k  . 

Completing  the  square  In  (3.2)  gives 

(v  -  (k-l)x/2k)2  < 1  +  ((k+l)x/2k)2  , 


(3.2) 


or  since  v  -  (k-l)x/2k  >0  from  (2.3), 


(k-l)x 

]’[ 

2k 

-1 


<  R  , 

x 


the  desired  result. 
Recall  that 


X'  - 


l+x2/k 


(v- (k-1) x/k) (2v- (k+1) x/k)  k-1 

k 


l+x2 /k 


v  k-1^  $  .  (v- (k-l)x/k) (2v- (k+1) x/k) 

l+x2/k  k  '  l+x2 /k 


(3.3) 


(3.4) 


Then  the  second  result  which  will  give  the  upper  bound  In  (2.2)  Is 
Theorem  3.2:  Let  X'  and  $  be  as  In  (3.4).  Then,  for  k  >  2, 
X'  > 0  or,  equivalently,  $  >  (k-l)/k. 

Proof :  Suppose  X*  <  0  for  some  x.  Then  there  exists  an  x 
such  that  $  <  (k-l)/k  and  ■  0,  since  $(0)  “  8c^2  >  (k-l)/k  by 
Lemma  3.1  and  11m  4>(x)  ■  (k-l)/k,  since  11m  v/x  ■  1  by  (2.3). 

X+co  X-»«» 

Now  using  (3.1), 


.,  [  (X-(k-l) /k) (2v- (k+l)x/k)  +  (v- (k-l)x/k) (2X-(k+l) /k) Kl+x2/k)-2 (l+x2/k)xfl/k 

9  ■  5  2 

<H*/k) 

_  (v-2x/k)$-( (k-1) /k)  (2v-(k+l)x/k)  +  (v-(k-l)x/k) (2>-(k+l) /k)  .  ( 3  5 ) 

(l+x2/k) 

Suppose  v  <  2x/k.  Then  x  <  v  <  2x/k,  since  v  > x  by  (2.3),  and  so 
kx<2x,  which  is  impossible  since  k  >  2,  and  so  v>2x/k.  Adding 
and  subtracting  (v-2x/k) (k-1) /k  to  the  numerator  of  (3.5),  gives, 
after  some  simple  algebra,  <j> 1  c  [  (v-2x/k)  (<j>- (k-1) /k)  + 

2  (v- (k-1)  x/k)  (A-l)  ]  /  (1+x2 /k)  <  0,  since  <J>  <  (k-1)  /k  by  assumption 

and  \  <  1  by  Theorem  3.1,  which  is  a  contradiction  and  completes 
the  proof. 

From  Theorem  3.2, 

(v-(k-l)x/k) (2v-(k+l)x/k)  >  (k-1) (1+x2 /k) /k  ,  (3.6) 


and  completing  the  square  in  (3.6)  gives 

(v- (3k-l)x/4k) 2  >  (8 (k-1) /k  +  ( (k+1 ) x/k) 2 ) / 16  , 


(3.7) 


and  since  v  >  (3k-l)x/4k ,  (3.7)  gives,  after  some  algebra. 


R  < 
x 


(3k-l)x  fk-1  r(k+l)x. 2’ 

4k  2k  v  4k  ’ 


-i-l 


(3.8) 


which  is  the  remaining  part  of  (2.2). 

To  complete  the  proofs  of  Theorems  3.1  and  3.2  we  need 

2  2 

Lemma  3.1:  l-4c^^  >  0  for  k>0  and  8c^  >  (k-l)/k  for  k  >  2. 

Proof :  The  proof  is  an  immediate  consequence  of  the  fact 

shown  in  Soms  (1979)  that  for  k>0  the  c^  are  a  strictly  increas¬ 
ing  function  of  k  with  limit  W/TtF.  For  the  sake  of  completeness 

we  provide  a  brief  outline  of  that  proof.  Let  g(k)  =  jj-  in  c  = 

1  k+1  Ik  1 

y  f  (— y- )  -  y  ¥(y)  -  yy,  where  l'(x)  is  the  digamma  function  (see, 
a.g.,  Abramovitz  and  Stegun,  1965,  p.  258).  From  Artin  (1964, 


'T 


p.  17),  h(k)  -  |  i  V  <^> 

Euler-MacLaur in  formula , 


00 


l 

i-0 


_ 1 _ 

(k+2i) (k+2i+l) * 


By  the 


1 

2 


1  1 
2  k (k+1) 


<  h  (k) 


» 


and  so  h(k)  >  giving  the  conclusion. 

It  is  reasonable  to  ask  whether  consideration  of  derivatives 

of  R  would  also  lead  to  bounds.  In  fact  it  does  lead  to 
x 

analogues  of  the  bounds  of  Gordon  (1941),  but  these  bounds,  just 
as  in  the  normal  case,  are  inferior  to  (2.2).  For  the  details  of 
these  and  some  other  second-best  bounds  the  reader  is  referred  to 
Soms  (1977a) . 


4.  Theoretical  Comparisons 

We  give  here  the  three  types  of  other  bounds  with  which 
either  numerical,  theoretical,  or  both  comparisons  will  be  made. 
The  bounds  of  Peizer  and  Pratt  (1968)  are 

1  _S-2(fc+2>  <  R  <  1  (k+2)(k+4)x2+2(k+l)k  (4>1 

x  x2(k+2)+k  x  x  (k+2) (k+4)x2+2 (k+1) k+k (k+4) 

the  bounds  of  Wallace  (1959) 


l-f[(k  £n  (l+x2/k))’5]  .  «  .  l-*[(l-l/2k),S(k  In  (l+x2/k))>s]  ,,  .. 

2 -  <  R,  <  - -  -  i  (4.2) 

(l+x  /k)ffc(x)  (l+xZ/k) ffc(x) 

k  >  j  for  the  upper  bound,  and  of  Soms  (1977b) 


where  p(x,y)  » 
2 


~~i — ^ — n — »  for  k>2»  Y_ln  -  — - - t 

(x  +4c  Z(1+y)Z)*+Y*  1  2(k+2)c,  2 


-  1, 


max 


"  ^ck  /  ^or  the  definitions  of  Yoin  and  y 


max 


are  interchanged  and  for  k«2,  y 


min 


Y  and  R 
max  x 


p(x-To.x>- 


-5- 


Let  us  denote  the  lower  bounds  of  (2.2),  (4.1),  (4.2),  and  (4.3) 
by  SL,  PL,  WL,  and  BL,  respectively,  and  the  upper  bounds  by  SU, 
PU,  WU,  and  BU . 

Then,  after  some  algebra,  comparison  of  (2.2)  with  (4.1) 

yields  the  following  results.  SL  Is  better  (bigger)  than  PL  for 

x<k/(k+2)J*.  SU  is  better  (smaller)  than  PU  for  x  <  -  + 

.2  ,  4  2a 

( - j~~)  ,  where 

4aZ 

a  «  -2 (k+2 ) (k+4 ) 2  , 
b  -  -2k(k+4) (2k2+3k+4)  , 
and  c  *  4 (k-1) k2 (k+1) 2  . 

Analytical  comparisons  with  Wallace  (1959)  appear  impossible. 

BL  is  better  than  SL  for  k  >  k  f  where  1  <  k  <2,  and  for 

o  o 

sufficiently  large  x  and  1  <  k  <  k  ,  SL  is  better.  Por  suffic- 

•  ”0 

iently  large  x,  SU  is  better  than  BU .  Further  analytical  compar¬ 
isons  here  are  prohibitively  difficult. 

We  note  here  that  for  k  * 1 ,  we  have  the  Cauchy  distribution 
and  (2.2)  and  (2.3)  combine  for  the  interesting  simple  and  sharp 
result 

#00 

1  ,  1  f  1  ...  -  1 

ird+x2)*5  v  'x  1+t2  irx 


5.  Numerical  Comparisons 

Let  g(x)  ■  (l+x2/k) f^(x) .  Then  (2.2)  gives 

g  (x) SL  <  Fk(x)  <  g  (x) SU  ,  (5.1) 

and  we  can  approximate  F^(x)  by  L  *  g(x)SL  and  U  *  g(x)SU.  This 
ia  done  in  Table  1  for  selected  degrees  of  freedom  and  known  t 
percentiles  (Cramer,  1946,  p.  560). 


i 

\ 


From  Table  1  is  appears  that  the  approximation  using  the  SL 
and  SU-bounds  gives  good  results  if  F^(x)  is  less  than  or  equal 
to  .  1 . 

In  Table  2,  SL,  PL,  WL ,  and  BL  are  given  and  in  Table  3,  SU , 
PU,  WU,  and  BU,  for  selected  x  and  k.  In  the  calculation  of  the 
WL  bounds  for  extreme  tails  the  Boyd  (1959)  approximation  for  the 
tail  area  of  the  normal  distribution  was  used.  A  indicates 

the  bound  is  inapplicable. 


While  the  superiority  of  SL  to  BL  for  k«  1  is  not  apparent  from 
the  tables,  it  is  seen  by  direct  calculation  that,  for  example, 
for  x  = 50  SL  is  bigger  than  BL.  In  general,  for  lower  bounds 
BL  is  best  while  the  picture  is  mixed  for  upper  bounds.  For  x 
larger  than  in  the  tables  the  performance  of  SL  and  SU  would  be 
better  than  for  BL  and  BU . 
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